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ABSTRACT 

This project work presents approach for cardiac arrhythmia disease classification. The GA-SVM method 
combines both Support Vector Machine (SVM) and Genetic Algorithm approaches. First, twenty two features from 
electrocardiogram signal are extracted. ECG feature extraction plays a significant role in diagnosing most of the cardiac 
diseases. One cardiac cycle in an ECG signal consists of the P-QRS-T waves. This feature extraction scheme determines 
the amplitudes and intervals in the ECG signal for subsequent analysis. These features are obtained semi automatically 
from time-voltage of R, S, T, P, Q features of an Electro Cardiogram signals. The amplitudes and intervals value of P- 
QRS-T segment determines the functioning of heart of every human. Recently, numerous research and techniques have 
been developed for analyzing the ECG signal. 

We used genetic algorithm for ECG arrhythmia classification. It is used to improve the generalization 
performance of the SVM classifier. In order to do this, the design of the SVM classifier is optimized by searching for the 
best value of the parameters that tune its discriminate function, and looking for the best subset of features that optimizes 
the classification fitness function. This system is automatically select a proper subset of features for optimizing the SVM 
classifier. We used MAT lab program for this method. After the running program, we get genetic algorithm - support 
vector method graph for ECG arrhythmia classification. 
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I .INTRODUCTION 

Classification of electrocardiograms (ECG) into different disease categories is a complex pattern 
recognition task. However, the analysis of electrocardiogram signals is the most effective available method for 
diagnosing cardiac arrhythmias. Computer-based classification of ECGs can provide high accuracy and offer a 
potential of an affordable cardiac abnormalities mass screening. Successful classification is achieved by finding the 
characteristic shapes of the ECG that discriminate effectively between the required diagnostic Categories. [ 1 ] A 
typical heart beat is identified from the ECG and the component waves of the QRS, T, and possibly P waves are 
characterized using measurements such as magnitude and duration. Datasets used for training and test of automated 
classification of ECG signals include many different features. However, one of the most popular and useful 
databases is the MIT-BIH. We used this database to various algorithms for arrhythmia detection and classification. 
Several methods have been proposed for the classification of ECG signals. [4] The arrhythmia of the heart is 
determined by the shape of the ECG waveform, which contains important information to different types of disease 
arises in the heart. However, the ECG signals are not regular in nature and it arises randomly at different time 
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intervals during a day. Sensor data mainly contain noise and distortion and hence before processing these data, the sensor 
electrocardiogram (ECG) data we need to remove as much noise, distortion present. So proper analyzing the interval 
between waveforms is needed [2]. 

Support vector machine and its types is described in section II. In section III, feature extraction and selection is 
described. Also the dataset is described in detail in this section. In section IV, feature reduction is described. The GA-SVM 
algorithm is described in section V. Experimental result is given in section VI. 

II. SUPPORT VECTOR MACHINE CLASSIFICATION 

In this section, a brief description of Support Vector Machines is mentioned. (SVMs) are very popular and 
powerful in pattern learning because of supporting high dimensional data and, at the same time, providing good 
generalization properties. SVMs have many usages in pattern recognition. At the beginning, SVM was formulated for two- 
class (binary) classification problems [6]. 

A. Binary Support Vector Machine Formulation 

Let X= { (x ; , y ; ) } n i=1 be a set of n training samples, where x, € R m is an m-dimensional sample in the input space, 
and y ; € { -1,1 }is the class label of sample x,. SVM finds the optimal separating hyper plane (OSH) with the minimal 
classification errors. The linear separation hyper plane is in the form of 

f(x) = w T x + b, 

Where w and h are the weight vector and bias, respectively [5]. 

B. Multi Class Support Vector Machine 

SVMs are usually binary classifiers, but, the classification of ECG signals mostly involves more than two classes. 
So, for this purpose, a number of multiclass classification strategies can be adopted. The most popular ones are the one- 
against-all (OAA) and the one-against-one (OAO) strategies. We use OAO for ECG multi class classification. The one 
against one constructs (n (n-l)/2)decision functions for all the combinations of class pairs[6]. 

III. FEATURE EXTRACTION AND SELECTION 

In this section we will explain the characteristics of the extracted feature from the ECG signals and the procedure 
designed for the extraction. Figure 1, presents the block diagram of the proposed arrhythmia classification. Details of each 
stage are explained later. 



Figure 1: Block Diagram of Proposed Arrhythmia Classification [2] 


A. Dataset Description 

In our experiment we are taking the ECG data as the basic signal for classification. In recent researches, the 
annotated ECG records, available at the MIT-BIH arrhythmia database, have been widely used for the evaluation of the 
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performance of different classifiers. The database has 48 records with each record being an ECG signal for the duration of 
30 minutes. Each data was recorded in two channels, modified limb lead II and modified lead VI. The database contains 
approximately 109,000 beat labels. ECG signals MIT-BIH Database are described by a text header file (.hea), a binary file 
(.dat) and a binary annotation file (.atr) [6]. 

In particular, the considered beat types refer to following classes: normal sinus rhythm (N), right bundle branch 
block(RB), left bundle branch block (LB), and paced beat (P). The beats were selected from the recording of following 
patients, which is shown in table 1 . 


Table 1: Data Set Descriptions and Numbers Used in the Simulation 


Cl. No. 

Record Ex. 

N 

L 

R 

A 

V 

F 

P 

F 

1 

100 

2239 



33 


1 



2 

105 

2526 




41 




3 

107 






59 



4 

217 

244 





162 

1542 

260 

5 

111 


2123 



1 




6 

212 

923 


1825 







B. Noise Reduction 


In the first stage of feature extraction, a wavelet transformation is performed in order to reduce noises. The 
wavelet transform allows processing non-stationary signals such as ECG signal. This is possible. Figure shows the ECG 
signal after reducing noises. We use fifth level of wavelet [12]. 


Absolute of Values of Ca.b for a = 1 2 3 4 5 ... 



50 100 150 200 

Time (or Space) b 


Figure 2: Input Signal after Using Wavelet Transform 
(First Level) 



Figure 3: Input Signal after Using Wavelet Transform 
(Fifth Level) 
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C. Feature Description 

For each signal nineteen temporal features such as R-R interval, PQ interval, PR interval, and PT interval and 
three morphological features are recognized. These features are manually extracted for each beat and put into a separate 
vector. Each vector is tagged with one the four possible labels N, P, LB, RB [6]. 

Features have been extracted including the time and voltage of Q/R/S/T/P and time interval for each of 5 features 
from the next feature such as RS/ ST/ QR as a mentioned in figure 4 and also the difference of voltage in these features 
such as V (Q)-V(S). The three morphological features by computing the maximum and the minimum values of a beat in 
ECG signal. Signals of each beat are scaled, using the following formula, such that the range of every signal is between 
zero and one [12], 

F (t) = f(f) - minft) 

Max (t)-min (t) 

The minimum and maximum voltages between the first and the second R feature is computer first and the 
normalization action is performed [0 1]. As mentioned before, we considered percent that are higher than 0.2, 0.5 and 0.8 
as three features. Six of the 22 features, called basic features, are: Rl, S, T, P, Q, R2 and the rest are called derived 
features. The derived features are calculated using the basic features via a semiautomatic procedure. 



Figure 4: Sample Features, ST Interval, TP Interval and RR Interval [6] 
The features which are used in the simulation are described in table 2. 

Table 2: Features Descriptions Used in the Simulation 


Feature No. 

Description 

01 

X(R1) 

02 

V(Rl) 

03 

X(S) 

04 

V(S) 

05 

X(T) 

06 

V(T) 

07 

X(P) 

08 

V(P) 

09 

X(Q) 

10 

V(Q) 

11 

X(R2) 

12 

V (R2) 

13 

X(R2) - X(R1) 

14 

V(R2) - V(R1) 

15 

X(S)-X(R1) 

16 

X(T) - X(S) 

17 

X(P) - X (T) 

18 

X(Q) - X(P) 

19 

X(R2)-X(Q) 


Impact Factor (JCC): 6.2879 


NAAS Rating: 2.40 


Wavelet Based ECG Arrhythmia Classification using GA-SVM Method 


49 


IV. FEATURE REDUCTION 

Many researchers said that not all the features are useful for classification. Some features may act as noises and, 
hence, reduce the classification accuracy. In this paper, feature reduction approaches are adopted. The studied showed that 
a meta-heuristic -based approach has better performance for classification of ECG arrhythmia than statistical method. 

A. Genetic Algorithm 

Genetic Algorithm is one prospective option for feature reduction. It can solve every optimization problem which 
can be described with the chromosome encoding. It solves problems with multiple solutions. The details of our 
implementation of GA are described as follows: 

Algorithm: Genetic Algorithm 

Input: Training Data 

Output: Useful Features 

StepO: initialize parameters (e.g. population size, crossover Rate, mutation rate and the maximum number of 
population 

generation.) 

Stepl: create initial population randomly (P (0)). 

Step2: evaluate current population (compute fitness of all 
chromosomes). 

Step3: while (termination condition not satisfied) do [step 

4-8] 

Step4: select P(t) from P( t+1 ) [perform selection] 

Step5: recombine P (t) [perform mutation and crossover] 

Step6: evaluate current population (compute fitness of all 
Chromosomes). 

Step7: t = t + 1 
Step8: go to Step 3 

V. THE PROPOSED GENETIC - SVM 

In this section, the proposed Wavelet based Genetic-SVM system for the classification of ECG signals is 
described. The aim of this system is to reduce noise from incoming signal and automatically select a proper subset of 
features for optimizing the SVM classifier. The upcoming figure shows the procedure followed. 

A. Genetic Set Up: 

The procedure of SVM classification system is as follows: 
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Step 1) Randomly generates an initial population of size 50. 

Step 2) For each chromosomes of the population, train P(0) SVM Classifiers. 

Step 3) Using OAO (multi -class SVM), compute the fitness of each chromosome (subset of features). 

Step 4) Directly Select some individuals from the current population based on the fitness values and regenerate 
new individuals from old ones. 

Step 5) If the maximum number of iteration is not yet reached, return to step 2. 

Steps 6) Select the chromosome with the best fitness value as the desired subset of features. 

Step 7) Classify the ECG Signals with the trained SVM 

Sensitivity refers to the rate of correctly classified positive. Sensitivity may be referred as a True Positive Rate. 
Sensitivity should be high for a classifier 

Sensitivity = TP / (TP+FN) 

True Positive (TP): arrhythmia detection coincides with decision of physician 
False Negative (FN): system labels an arrhythmia as healthy 



Figure 5: The Wavelet Based Genetic -SVM Approach [2] 

VI. EXPERIMENTAL RESULT 

For the evaluation of the proposed method, 50% of all the data of the MIT-BIH dataset are used for training the 
composed system and the rest are used for the evolution. The genetic-SVM classifier is used to the best subset of features 
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that can optimize the SVM classifier. Wavelet transform reduces noise signal and helps to improve result. The results of 
this experiment are presented in the Table 3, 4 and 6. The graph of different approaches is mentioned bellow figure 6, 7 
and 8. The comparison of parameters of all methods is shown in table 6. 


Table 3: The Arrhythmia Classification Result using GA-SVM Method 


P,LR 

P,LL 

P,N 

LR,LL 

LR,N 

LL,N 

Overall 

89.90 

95.11 

100 

95.56 

100 

98.01 

91.13 


GA-SVM 



Figure 6: Result of GA- SVM Method 
Table 4: The Arrhythmia Classification Result Using GA-SVM Method 


P,LR 

P,LL 

P,N 

LR,LL 

LR,N 

LL,N 

Overall 

89.90 

98.11 

90.01 

93.11 

84.01 

85.01 

93.13 



Figure 7: Result of Wavelet Based GA-SVM Method (First Level) 
Table 5: The Arrhythmia Classification Result Using GA-SVM Method 


P,LR 

P,LL 

P,N 

LR,LL 

LR,N 

LL,N 

Overall 

89.90 

99 

90 

92.11 

84.01 

85.01 

95.12 



Figure 8: Result of Wavelet Based GA-SVM Method (Fifth Level) 


www.tjprc.ors 


editor@tjprc. org 


52 


Raorane Shweta Suresh& Jadhav Sharad P 


Table 6: Comparisons of Parameters of Different Methods 


Parameter 

SVM 

SVM-GA 

Wavelet-SVM 

Wavelet-SVM-GA 

Overall sensitivity 

0.8812 

0.9113 

0.8913 

0.9512 

SVM sensitivity 

0.8805 

0.9201 

0.9311 

0.9302 

Executing time 

7.5s 

35.14s 

40.1s 

42.12s 

Efficiency 

0.7502 

0.7812 

0.8012 

0.9021 

Quality factor 

0.92 

0.93 

0.94 

0.96 


VII. CONCLUSIONS AND FUTURE WORK 

In this paper, a new method based on support vector machines for feature selection of electrocardiogram signals is 
proposed. Experimental results show that feature selection greatly improves the quality of classification. This is because 
some features may act as noises and negatively affect the outcome. A flexible and effective GA-SVM method for 
classification of ECG signal was presented and showed to be effective on MIT-BIH dataset to recognize arrhythmia 
classes. For future work, diseases of ECG can be identified. 
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